Automated Template Discovery for Information Extraction from Biomedical Literature

نویسندگان

  • Satoshi Kamegai
  • Kenji Satou
چکیده

We propose a method to automatically extract templates from biomedical literature without background knowledge. The proposed method automatically extracts verbs and templates indicating interactions between biomolecules with a large dictionary called an extensional ontology. We applied our method to two datasets: one comprised 299 full texts from Cell (1998– 2002) and 13,818 entries from OMIM (Online Mendelian Inheritance in Man); the other included 33,622 abstracts from Medline (2002). Experimental results showed that our method could extract verbs and templates that had been manually collected in related works. For extracting templates, our method only needs to prepare ontology (or dictionary) and a large body of texts. Consequently, it can be applied to those of other fields as well as the biomedical literature.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Survey on Perception of People Regarding Utilization of Computer Science & Information Technology in Manipulation of Big Data, Disease Detection & Drug Discovery

this research explores the manipulation of biomedical big data and diseases detection using automated computing mechanisms. As efficient and cost effective way to discover disease and drug is important for a society so computer aided automated system is a must. This paper aims to understand the importance of computer aided automated system among the people. The analysis result from collected da...

متن کامل

LitLinker: A System for Searching Potential Discoveries in Biomedical Literature

The explosive growth in biomedical literature has made it difficult for researchers to keep up with advancements, even in their own narrow specializations. While researchers formulate new hypotheses to test, it is very important for them to identify connections to their work from other parts of the literature. However, the current volume of information has become a great barrier for this task, ...

متن کامل

Semi-Automated Semantic Annotation of the Biomedical Literature

Semantic annotations are a core enabler for efficient retrieval of relevant information in the life sciences as well in other disciplines. The biomedical literature is a major source of knowledge, which however is underutilized due to the lack of rich annotations that would allow automated knowledge discovery. We briefly describe the results of the SASEBio project (Semi Automated Semantic Enric...

متن کامل

Using statistical and knowledge-based approaches for literature-based discovery

The explosive growth in biomedical literature has made it difficult for researchers to keep up with advancements, even in their own narrow specializations. While researchers formulate new hypotheses to test, it is very important for them to identify connections to their work from other parts of the literature. However, the current volume of information has become a great barrier for this task a...

متن کامل

Enhancing a biomedical information extraction system with dictionary mining and context disambiguation

Journals and conference proceedings represent the dominant mechanisms for reporting new biomedical results. The unstructured nature of such publications makes it difficult to utilize data mining or automated knowledge discovery techniques. Annotation (or markup) of these unstructured documents represents the first step in making these documents machine-analyzable. Often, however, the use of sim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006